Goto

Collaborating Authors

 theoretical statement


We have reformulated the theoretical statements to position 1 them with previous work, clarified notation and surface-level inconsistencies in theoretical statements, added

Neural Information Processing Systems

We thank the reviewers for their constructive feedback. Fu et al. (2020) have added results for A WR, BCQ, REM and AlgaeDICE in R1/[R3, W1]: Policy improvement result, what is CQL doing. Based on R1/R3's requests, we have added a new This follows from applying and extending the tools in Achiam et al. 2017. Note the similarity with Thm. 2 in (Laroche Since our submission, Nair et al. 2020 and Ghasemipour et al. 2020 have discussed We will add extended discussion of this point in the paper. We now explicitly indicate dimensions of vectors, matrices and scalars.


We have reformulated the theoretical statements to position 1 them with previous work, clarified notation and surface-level inconsistencies in theoretical statements, added

Neural Information Processing Systems

We thank the reviewers for their constructive feedback. Fu et al. (2020) have added results for A WR, BCQ, REM and AlgaeDICE in R1/[R3, W1]: Policy improvement result, what is CQL doing. Based on R1/R3's requests, we have added a new This follows from applying and extending the tools in Achiam et al. 2017. Note the similarity with Thm. 2 in (Laroche Since our submission, Nair et al. 2020 and Ghasemipour et al. 2020 have discussed We will add extended discussion of this point in the paper. We now explicitly indicate dimensions of vectors, matrices and scalars.